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From behavioral sciences to biology to quantum mechanics, one encounters situations where (i) a system 
outputs several random variables in response to several inputs, (ii) for each of these responses only some of the 
inputs may "directly" influence it, but (iii) other inputs provide a "context" for this response by influencing its 
probabilistic relations to other responses. These contextual influences are very different, say, in the classical 
kinetic theory and in the entanglement paradigm of quantum mechanics, traditionally being interpreted as 
representing different forms of physical determinism. We show how one can quantify and classify all logically 
possible contextual influences, based on probabilistic coupling: studying sets of joint distributions imposed on 
random outputs recorded at different (mutually incompatible) values of inputs. 

Keywords: Bell/CHSH inequalities; context; coupling; determinism; EPR paradigm; selective 
influences. 



1. INTRODUCTION 

Consider a system with two inputs, a, /3, and 
two random outputs, A, B, about which it is as- 
sumed that A is not influenced by /3, nor B by 
a. A necessary condition for this selectivity of 
influences assumption is marginal selectivity Q|: 
changes in the value of /3 do not influence the dis- 
tribution of A, and analogously for a and B. In 
physics this condition is known under a variety 
of other names The assumption of selective 
influences, however, is stronger. It requires the 
representability of two outputs as 



A = f(R,a),B = g(R,p), 



(1.1) 



where /, g are some functions and R a source 
of randomness that does not depend on a, (3 [#|- 
[£| . Such a representability may or may not exist 
when marginal selectivity is satisfied. According 
as which is the case, one can think of a and /? 
as being involved in different kinds of probabilis- 
tic context for the dependence of, respectively, B 
on (3 and A on a. Equation (1.1) is tradition- 



ally viewed as indicating lack of contextual in- 
fluences (in physics, classical determinism), but 
one can still think of various special cases of this 
"no-context" contextuality. Analogously, one can 



think of various kinds of violations of ( 1.1 1 



whether within or outside the scope of ( 1.1 1. Our 



We propose a principled way of quantifying 
and classifying conceivable contextual influences, 



approach is neutral with respect to such issues as 
causality or what distinguishes direct influences 
from contextual. We merely accept as a given 
a diagram of input-output correspondences (e.g., 
A <— a, B ■<— (3) and study the joint distribution 
of the outputs at all possible values of the in- 
puts. The interpretation of the diagram is irrele- 
vant insofar as it is compatible with the observed 
pattern of marginal selectivity. Our approach is 
maximally general in the sense of applying to ar- 
bitrary sets of inputs and outputs (see Appendix 
A). To demonstrate it by detailed computations, 
however, we confine ourselves here to binary a, (3 
influencing binary A, B, and even more narrowly, 
to the "homogeneous" case with the two values of 
both A and B equiprobable at all values of the 
inputs. Marginal selectivity then is satisfied triv- 
ially with respect to any diagram of input-output 
correspondences, as all outcomes have probability 
x ji irrespective of a,j3. 

The example focal for this paper is Bohm's 
version of the Einstein-Podolsky-Rosen paradigm 
(EPR/B) [7o|: a quantum mechanical (QM) sys- 
tem consisting (in the simplest case) of two en- 
tangled spin-!/2 particles separated by a space- 
like interval. The two inputs here are spin mea- 
surements on these particles: input a has two 
values corresponding to spin axes ct\,ai chosen 
for one particle, and input j3 has two values cor- 
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responding to spin axes j3i , /?2 for another parti- 
cle. Note that the spin measurements on a given 
particle along two different axes are noncommut- 
ing, and this is the reason a\,ai can be viewed 
as (mutually exclusive) values of a single input. 

I7[7g| . 



The two out- 



and analogously for P\ 

puts are spin values recorded: having chosen axes 
cti and /3j, i,j € {1,2}, one records A^ for the 
first particle and B^ for the second, each be- 
ing a random variable with values "spin-up" and 
"spin-down." We consider the case when these 
two outcomes are equiprobable for both A^ and 
Bij-. Pr [A^ = up] = Pr [B^ — up] — 1 /2 for all 

i,je{l,2}. 

Formally equivalent situations are abundant in 
behavioral and social sciences ff| 13 1 ? , where 
the issue of selective influences was initially in- 
troduced in |i<g|iff| , in the context of informa- 
tion processing architectures. An example of 
a system here (from our laboratory) can be a 
human observer who adjusts a visual stimulus 
until it matches in appearance another visual 
stimulus, characterized by two properties, a and 
/3 (e.g., amplitudes of two Fourier-components), 
each varying on two levels, ct\, a 2 and j3\, @2- De- 
noting by S}j and Sfj the corresponding prop- 
erties (amplitudes) of the adjusted stimulus in 
response to Oj , j3j , we define a binary random 
output Aij as having the value "high" or "low" 
according as the variable Sfj is above or below 
the median of its distribution; output B^ is de- 
fined from Sfj analogously. Marginal selectiv- 
ity is ensured by construction: Pr [Aij = high] = 
Pr [B^ = high] = 1/2 for all i, j € {1, 2}. 

Another example is a population of people each 
of whom takes two exams (say, in French and 
in chemistry) and gets numerical scores S 1 and 
S 2 . Let each score be observed after one of two 
preparation times (ai = "short," a2 = "long" for 
5 1 , and analogously for S 2 ). We define a binary 
random variable Aij as having the value "pass" 
or "fail", according as the score £y is above or 
below the median of all scores for ai, fy; output 
Bij is defined analogously. 

In an example from a biological domain S}j and 
Sfj could be activity levels of two neurons tuned 
to two stimulus properties, a and f3, respectively. 
Making a and /? vary on two levels each and defin- 
ing A^ , B^ with respect to the medians of Sjj , Sfj 
by the same rule as above, we get precisely the 
same mathematical formulation. 



2. FORMS OF CONTEXT (DETERMIN- 
ISM) 

In the following, symbols i,j,k (possibly with 
primes) always take on values 1, 2 each. Equation 
(1.1 1 is equivalent to the existence of a jointly 



distributed system 

such that every observable pair A^ , B^ is dis- 
tributed as H\,H 2 ; in symbols, 



(//;.//;} - Li,,.//,,* 



This statement 
of) the Joi n t 
U\2b\21 



ease 



is known as (a special 
Distribut ion Criterion (JDC) 
Its link to jOj is given by B\\ = 
(R, a^, Hj = g {R, (3j). JDC is a deep criterion 
that provides a probabilistic foundation for our 
understanding of the classical (non)contextuality 
(or determinism) in physics. 

Denoting by a\, a 2 (and b\, 62) the two possible 
values for Ay (resp., Bij), let 

Pij = Pr [A^ = ax, B,^ = 61] . 

JDC in this case is equivalent to four double- 
inequalities 

1 - C 1 + C fn _ , 

— — < p^ +pij, +Pi'j> -Pi>j < — - — • (2.1) 



with i ^ i', j ^ j' and C = 1. \^8\. They are 
known as Bell/CHSH inequalities (in the homo- 
geneous form), CHSH acronymizing the authors 
of [|. The QM theory of the EPR/B paradigm 
predicts and experimental data confirm violations 

\22j23] , but QM 



of the Bell/CHSH inequalities 



■5p? : inequality (2.1 1 



imposes its own constraint 

with C = \pl. However, there is no analogue or 
extension of JDC equivalent to this constraint. In 
other (e.g., behavioral) applications, one cannot 
exclude a priori the possibility of C exceeding y/2, 
falling between 1 and v2, or being constrained by 
some value below 1. Moreover, one can conceive 
of 

m < p^ +Pij> +Pi>j> - Pi'j < M 

with M + m 7^ 1. It would be unsatisfactory if 
all these possibilities, whether or not empirically 
realizable, could not be treated within a unified 
probabilistic framework including JDC as a spe- 
cial case. We construct such a framework, based 
on the classical (Kolmogorov's) theory of proba- 
bility and the probabilistic coupling theory 25 . 
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3. CONNECTIONS 

It is easy to see that for any observable 
p = (pn,pi2,P2i,P22) one can find a jointly dis- 
tributed system of binary variables 

tt / rrl rr2 rrl tt2 rrl tt2 rrl tt2 \ 

n — V"1X> 11' 12' "12) -"21' -"21' "22' "22/ 

such that 



for all The JDC then amounts to additionally 
assuming that 

Pr [H^ ± H\ 2 \ = 0, Pr [fl* ^ fl* ] = 0, 

and this is the assumption that is rejected by QM 
in the EPR/B paradigm. This assumption, how- 
ever, is not the only way of thinking of H . Since 
An and A.- l2 (or By and B 2 j) occur under mu- 
tually exclusive conditions, there is no privileged 
pairing scheme for their realizations, whence zero 
values for Pr [H a ^ H} 2 ] , Pr [Hy ^ H^] are as 
acceptable a priori as any other. Our approach 
consists in replacing this assumption with 



Pr \H*. £ H% 



H G [0,1], 
: 2e) G [0, 1] , 



and characterizing the dependence of ( A, B) on 
(a, 0) by properties of the set of all 4-vectors 
e — (e\, e\, e\ , e 2 ) that are compatible with or im- 
ply certain constraints imposed on the observable 
vectors p = (pn,pi2,P2i,P22)- Having adopted 
a particular diagram of input-output correspon- 
dences (in our case, A <— a, B <— f3), we can also 
say that these sets of e characterize the contex- 
tual role of a, @ for B and A, respectively. We 
call e a connection vector. 

To distinguish our approach from other forms 
and meanings of probabilistic contextualism, e.g., 
26\27\28\ , we dub it the "all-possible-couplings" 
approach. The term "coupling" refers to impos- 
ing a joint distribution (say, that of H\ x , H\ 2 ) on 
random variables that otherwise are not jointly 
distributed (An and Ai 2 ). Although here we do 
not need to introduce it explicitly, in more com- 
plex settings the set of couplings is a construct 
separate from A and H (see Appendix A). 



4. EXTENDED LINEAR FEASIBILITY 
POLYTOPE (ELFP) 

We need this notion to investigate constraints 
imposed on the observable p. ELFP is the set 



of all possible (p, e) for which there exists jointly 
distributed Hy such that, for all i,j, 

[III. Ill) ~ (Aij, By) 
i.e. 

Pr [H^ = a 1 ,Hf J =h] = Pij 

(Hh,Hl 2 )~(A lU A l2 ) 
i.e. 

Pr[Hl 1 = a 1 ,H} 2 = a 2 ] = e] 

\Hlj,Hl-) ~ (By,B 2 j) 
i.e. 

Pr [H* j = b 1 ,Hi j =b 2 ]=e 2 j 

The existence of such an H means the existence 
of a probability vector Q consisting of the 2 8 joint 
probabilities 



Pr [Hi 



n 2 

> -"22 



h 2 1 

"•22J I 



hjj G {ai,a 2 }, hy G {bi,b 2 }. Let P denote the 
2 5 -componcnt vector consisting of 2 observable 
probabilities 

Pi [-^ij — ^ij , ^ij — 

and 2 4 connection probabilities 



Pr [An = a tl ,A l2 = a i2 ] , Pr [By 
aij G {oi,a 2 }, hj G {61,62} 



by,B 2j 



b 2j ] 



Define a 2 5 



2 8 Boolean matrix M 



whose rows are enumerated in accordance 
with components of P (i.e., by equalities 
[Aij = a,ij,Bij = bij], [An = an,A i2 = a i2 ], or 
[By = by, By — b 2 j\) and columns in accor- 
dance with components of Q (i.e., by equalities 
[H\ 1 =h\ 1 ,Hl 1 = h\ 1 ,...,H* 2 = hl 2 ]). _ An 
entry of M contains 1 if and only if the 
corresponding random variables in the enu- 
merations of its row and its column have the 
same values: e.g., if a row is enumerated 
by [B12 = 612, B 22 = b 22 ] and a column by 
[H{ } =h\ x ,..., H\ 2 = h 2 12 ,..., H\ 2 = h\ 2 \ , then 
their intersection contains 1 if and only if 
h 12 = b r2 , h 22 = b 22 . 

It is easy to see that H exists if and only if 

MQ = P 

for some vector Q > (componentwise) of prob- 
abilities. The vectors P for which such a Q ex- 
ists are exactly those within the polytope whose 
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Table 1: Characterizations of the sets of four different types (columns) subject to three constrains (rows). 





All (p, e ) 


Fit (e) 


Force (e) 


Equi (e) 


chaos 


€ ELFP 


maxSe < V 2 


e [0,11 4 


maxSe < V 2 


quant 


max Sip < 


maxSoe < 5= ^, 
maxSie < 1 /2 


max S £ > 


3-V2 c c - 
max Sie < x /2 


class 


maxSip < 1 /2 


max Sie < Y2 


1 6 Soe 


1 e S e 



vertices are the columns of the matrix M. The 
term ELFP is due to this construction extend- 
ing that of the linear feasibility test in 11 . This 
test, among other applications, is the most gen- 
eral way of extending the Bell/CHSH criterion to 
an arbitrary number of particles, spin axes, and 
spin quantum numbers 11\1S\29 31 



To describe ELFP by inequalities on (p, e), we 
introduce the 16-component sets 



3. Find the set Force cons t r of e G [0, 1/2] that 
force all observables p to satisfy constr: 

e G Force constr ((p,e) G ELFP=> constr (p)). 

4. Find the set Equi constr of e G [0, 1 /2\ i for 
which an observable p satisfies constr if and 
only if (p, e) is in the ELFP set: 



Sp = < 



± (p u - 1/4) ± (p 12 - y*) 

± (P21 - X /4) ± (P22 - X A) 

each ± is + or — 



£ £ Equi c 



(constr(p) e ELFP). 



Clearly, Equi c 



Force c 



. n Fit r 




Sop and Sip denote the subsets of Sp with, respec- 
tively, even (0,2, or 4) and odd (1 or 3) number 
of + signs; So£ and Si£ are defined analogously. 
ELFP is described by 

max (max Sop + max Sie, maxSip + max Soe) < 3 /2 

(4.1) 



To illustrate, we focus on the following four 
benchmark constraints. The no-constraint situ- 
ation is given by 

chaos(p) <^=> p g [0, !/2] 4 , 

equivalent to ( |2.1[ ) with C = 2. The quantum 
constraint is given by 

quant (p) maxSip < "^2/2, 



equivalent to (2.1 1 with C = y/2. The "classical" 



(see Appendix Bl I 



5. ALL-FIT-FORCE-EQUI SETS 
CONSTRAINTS ON OBSERVABLES 



AND 



constraints are given by 

class (p) -4=> max Sip < 1 /2, 

equivalent to the Bell/CHSH inequalities (C = 
1). Finally, we consider the constraint 



Let constr (p) denote any constraint (e.g., in- 
equalities) imposed on p. Our approach consists 
in characterizing this constraint by solving the 
following four problems: 

1. Find the set All constr of all (p, e) G [0, !/2] 8 
with p subject to constr (p): 

(p, e) G All constr ^> (constr(p) and (p, e) G ELFP) 

2. Find the set Fit cons t r of connection vectors 
e G [0, !/2] 4 that fit (are compatible with) 
all observables satisfying constr: 

e G Fit const <^ (constr(p) => (p,e) G ELFP). 



fix(p) 



p = specific vector. 



For all constraints except for fix (p) the sets All, 
Fit, Force, and Equi are as sh own in Table 1 (for 
derivations see Appendix B2). 
Thus, 



Fit, 



clia 



is the set of all e such that 
maxSe < 1 /2: if an e is in this set, then any p 
(no constraints) is compatible with it. Force quan t 
is characterized by max Soe > if an e is 

in this set, then all compatible with it p satisfy 
quant (p). Equi c j ass is the set of all e such that Soe 
contains 1: for any such an e, a p is compatible 
with it if and only if it satisfies class (p). 
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Figure 1: Left: Profiles Vol 8 (All constr ) -> Vol 4 (Fit constr ) -» Vol 4 (Force COIls tr) ->■ Vol 4 (Equi constr ) for 
constraints chaos, quant, and class. Right: Vol 4 (Fitfi x ( p )) as a function of x = max Sop and y = 
maxSip. The possible (x,y) -pairs form the triangle ((0, 0), (1/2, 1), (1, V 2 ))> and Vol 4 (Fit flx(p) ) = 1 + 
^ (-1 + 8x - 24a; 2 + 32a; 3 - 16a; 4 ) + ^ (-1 + 8y - 24y 2 + 32y 3 - 16y 4 ), where p(z) = 1 if z > J/a and 
p (z) — otherwise. 



For each of these sets we compute Vol d , its vol- 
ume normalized by that of [0, 1 /2] d , with d be- 
ing the dimensionality of the set (Fig. 1, left). 
Thus, the defining property of Force c i ass , 1 £ SqS, 
is satisfied if and only if either all e\ are 0, or 
they all are 1 /2, or two of them are and two 
1/2. Hence Vol 4 (Force c i ass ) = 0. For nonzero 
volumes, the derivation is described in Appendix 
|B2| Each panel of Fig. 1, left, can be viewed 
as a "profile" of the corresponding constraint. 
Each of the first three volumes in a panel can be 
viewed as characterizing the "strictness" of con- 
straint, in three different meanings. The intuition 
of a stricter constraint is that it corresponds to 
a smaller Vol 8 (All constr ), larger Vol 4 (Fit constr ), 
and smaller Vol 4 (Force con str)- 

The constraint fix (p) has to be handled sepa- 
rately. Clearly, Vol 8 (All fix(p) ) = 0. Fit fix(p) is 
described by 



maxSie < 3 / 2 — maxSoP, 
maxSoe < 3 /2 — maxSip, 



and Vol (Fitfi x ( p )) is a polynomial function of 
maxSoP and maxSip, these two quantities form- 
ing the triangle ((0, 0), (1/2, 1), (1, V 2 ))- The P ol y- 
nomial and its values are shown in Fig. 1, right 
(see Appendix |B3| for computational details). 
Forcefi x (p) is clearly empty, hence so is Equi fix ( p ) . 



6. CONCLUSION 

At the cost of greater computational complex- 
ity but with no conceptual complications the 
computations involved in this demonstration of 
the all-possible-couplings approach to probabilis- 
tic contextuality can be extended to more gen- 
eral cases: arbitrary marginal probabilities, non- 
linear constraints, and greater numbers of inputs, 
outputs, and their possible values. The language 
for a completely general theory, involving unre- 
stricted (not necessarily finite) sets of inputs, out- 
puts, and their values, is presented in Appendix 
A. 
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APPENDIX A: All-possible-couplings ap- 
proach on the general level 

We show here how the approach presented in 
the main text generalizes to arbitrary sets of in- 
puts and random outputs. We use the term se- 
quence to refer to any indexed family (a func- 
tion from an index set into a set), with index 
sets not necessarily countable. We present se- 



(> 
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quences in the form (x v : y EY), (x z : z G Z), or 
[x\ : y £ Y,z £ Z). A random variable is under- 
stood most broadly, as a measurable mapping be- 
tween any two probability spaces. In particular, 
any sequence of jointly distributed random vari- 
ables is a random variable. For brevity, we omit 
an explicit presentation of probability spaces and 
distributions. In all other respects the notation 
and terminology closely follow 75| 

An input is a set of elements called input val- 
ues. Let a = (a k : k £ be a sequence of in- 
puts. A treatment is a sequence = (x k : k £ K) 
that belongs to a nonempty set $ C Ilfceif ak ( so 
that x k £ a k for all k £ K). If G fe G K, and 
I C K, then (k) = x k £ a k and 0| I is the re- 
striction of to /, i.e., the sequence (x k : k £ I). 

An output is a random variable. Let 

(a 1 ^ : k £ K,(f> £ $j be a sequence of outputs 
such that 



1. A^ = yA^ : k £ K j is a random variable 

for every £ $, i.e., the random variables 
A k ^ across all possible k possess a joint dis- 
tribution; 

2. if 0,0' G I C K, and 0|J = 0'|I, then 
(A* : fc G I) ~ (a*, : * G l) 



Property 2 is (complete) marginal selectivity [£|. 
A^ is called an observable (random variable), and 
A = (Ad, : € $) is the sequence of observables. 

Remark A.l. The interpretation is that for every 
0, each a fc may "directly" influence A*; but no 
other output in A^. The fact that inputs in a = 
(a k : k £ AT) and outputs in an observable A^ = 

(yA 1 ^ : k £ K^j are in a bijective correspondence is 

not restrictive: this can always be achieved by an 
appropriate grouping of inputs and (re) definition 
of treatments 



Remark A. 2. The case considered in the main 
text corresponds to K = {1, 2}, 



a = (a , a 2 ) 



with a k = {a^o^} for k£ {1,2}, 



$ = {011, 012, 021, 022} 

with (j>ij = (a}, a 2 ) for i,j £ {1, 2} , 
and (abbreviating Ad, i} as A*j and A^.. as AjA 

A= (Aii,Ai2,A 2 i,A 2 2), 
with A 2J = (Af, A 2 ) for i, j G {1, 2} , 



where each A k j is a binary random variable with 
Pr [A% = 4] = Pr [A* = a k ] = 1/2. 

Given a sequence of observables A = 
(Atfj : G $), a sequence of random variables 



K:ren^Ve2 K -{U} 

V fee/ / 



(not necessarily jointly distributed) is called a 
connecting set for A if each is a coupling for 

A^ = (A^:0e$,0|/ = r), 

where A^ = ( A^ : fc G /) . This means that 
is a random variable of the form 

C T / =(C T / :0e$,0|/ = r) 



with 



r 1 ~ /i 7 



for all G $ such that 0|7 = r. is 
called an (I ^t) -connection. The indexation 
in C^j is to ensure that if (I,t) 7^ (I',t'), 

then and C^, are stochastically unrelated. 
An identity (I , r) - connection C\ is one with 



Pr 



r 7 

T,<j>' 



£ <E>. For- 



1 for any q 

mally, A itself can be viewed MS M connection C?, 
but it is preferable to keep A separate by not al- 
lowing 7 = 0, as A is the only observable part of 
the construction. 

Remark A. 3. It is generally convenient not 
to distinguish identically distributed connec- 
tions. By abuse of language, the distribution 
of Cf- (or some characterization thereof) can 
also be called (I, r) -connection. We use this 
language in the main text when we represent 
({fc} , k 1— > <xf) -connections (without introducing 
them explicitly) by probabilities e k and call e a 
connection vector. See Remark lA.41 

A jointly distributed sequence 

H=(H k :k£K,qh£$>) 

is called an Extended Joint Distribution Sequence 
(EJDS) for (A, C A ) if for any I £ 2 K - {0, K} 
and any r G H k€l a k , 

ff^= (i^:0G$,0|/ = "r) ~C^, 
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where = \ H§ : k G l\ , and 

H£ = (H*:keK) ~ A 

for any </> G 

Remark A. 4. For the case considered in the main 
text, a connecting set for ^4 is (conveniently re- 
placing Cg } , C£ } | {1} , and Cjj |{2} with C§ , 
C}, and C|, respectively) 

C A = {(%,(%,(%,(%) 

with Cj 1 = (pla, C ijj2 ) and C| = (pj,ij > Cj,2j) > 
such that 

cf^-^ :•„, ;../. {1,2}. 

An EJDS for (A, CU) is a random variable (using 
analogous abbreviations) 

TT / Tjl Tj2 Tjl Tj2 Tjl Tj2 Tjl Tj2 \ 

such that 

( H ln H i2) ~ Q 1 ; ( H iji H 2j) ~ 

and 

2T# = ~ Ay = (4,4.) for i, j G {1, 

In the main text each Cf is represented by e\ and 
each H}f by p ly 

An EJDS for (A, Ca) reduces to the Joint Dis- 
tribution Criterion set (JDC set) of the theory 
of selective influences | if a U connections in 

Ca are identity ones. Note that no connection 
has an empirical meaning: for distinct (f>, cf>' G 
the variables A 1 ^ and A 1 !, corresponding to C 1 . ^ 
and C\ 0, do not have an observable (or theoret- 
ically privileged) pairing scheme. 

Let X be any set whose elements are sequences 
of observables A = (A^ : (f> G $). X can be 
viewed as the set of all possible observables satis- 
fying certain constraints. We define the sets Allx, 
Fitx, Force^, and Equix as follows: 

1. Allx is the set of all pairs (A, Ca) such that 
Ae X and there exists an EJDS H for (A, Ca) ■ 

2. Fitx is the set of all Ca such that 

Ae X => there exists an EJDS H for (A, Ca) ■ 



3. Forcex is the set of all Ca such that 
there exists an EJDS H for (A, C A )=^ Ae X. 

4. Equi x = Forcex H Fitx, that is, Ca G 
Equix if and only if 

Ae X there exists an EJDS H for (A, C A ) ■ 

The all-possible-couplings approach in the gen- 
eral case consists in characterizing any X (inter- 
preted type of contextuality or determin- 
ism) by Allx, Fitx, Forcex, and Equix- A 
straightforward generalization of this approach 
that might be useful in some applications is to 
replace Ca in all definitions with a subset of Ca , 
or several subsets of Ca tried in turn. Thus one 
might consider connections involving only partic- 
ular I C K (e.g., only singletons), or one might 
require that some of the connections are identity 
ones. 



APPENDIX B: Computational Details 

Bl. Computations for ELFP 

A convex bounded polytope can be equiva- 
lcntly defined either as the convex hull of a set of 
points (V-representation) or as the intersection 
of half-spaces (H-representation) . For our pur- 
poses, a V-representation of a convex polytope in 
d-space is given by a set of points x\, . . . , x n G K d . 
The polytope consists of all convex combina- 
tions of these points: XiX\ + ■ ■ • + X n x n , for all 
Ai, . . ., A n > 0, Ai + • ■ ■ + A„ = 1. It is possible 
that the polytope is of lower dimension than the 
space M. d in which it is defined if all the points Xj 
reside in a lower dimensional affine subspace of 
K d . A minimal V-representation (including only 
extreme points, i.e., points that are vertices of the 
polytope) is unique. The H-representation of a 
convex polytope is given by vectors a\, . . . , a m G 
]R d and a vector b G M. m . The polytope consists 
of the points x G R d satisfying afx < bi for all 
i = 1, . . . , m. A lower-dimensional convex poly- 
tope can be represented by including inequalities 
of the forms a T x < b and (— a) T x < ~b for some 
a and b or by explicitly specifying certain con- 
straints as equations in the representation. For 
a full-dimensional convex polytope, the minimal 
H-representation is unique. However, for a lower- 
dimensional polytope, the equation constraints 
can be specified in many equivalent ways and the 
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inequality constraints can look different depend- 
ing on which of the linearly related coordinates 
are used to specify them. 

There exist algorithms for converting be- 
tween the two representations of a convex 
poly tope in exact rational arithmetic. We 
have used our own program for these con- 
versions but other programs, such as Irs 
(http:/ /cgm.cs.mcgill.ca/~avis/C/lrs.html), can 
do the same. The conversion between the 
two representation is computationally demand- 
ing, the algorithms generally requiring superpoly- 
nomial time in the size of the input. 

A computationally simpler problem is elimi- 
nating redundant points (those that are not ver- 
tices of the polytope) from a V-representation or 
eliminating redundant equations or inequalities 
from an H-representation. This problem can be 
solved by linear programming and the algorithm 
is implemented in the redund program that comes 
with Irs. However, the redund program is not suf- 
ficient for putting an H-representation to a mini- 
mal form as it cannot convert sets of inequalities 
into equivalent equations (e.g., the three inequal- 
ities x>0,y>0,x + y<0 should be minimally 
represented as the two equations x = 0, y = 0). 
To find the minimal H-representation, for every 
constraint afx < bi or afx = bi in turn, one 
can find the upper and lower bounds u and I by 
maximizing and minimizing the expression afx 
given the other constraints, and apply the follow- 
ing rules: 

1. if this is an equation constraint (i.e., afx — 

and u = I = bi, then the constraint is 
redundant and can be eliminated; 

2. if this is an inequality constraint (i.e., 
afx < bi) and u < bi, then the inequality 
is redundant and can be eliminated. Oth- 
erwise, if I = bi, then the constraint should 
be converted to an equation. 

The dimension of a polytope can be determined 
from a minimal H-representation. It is the di- 
mension of the space minus the number of equa- 
tion constraints in the minimal representation. 
Given a full-dimensional polytope, its volume 
can be computed using the Irs program along- 
side the conversion from a V-representation to 
an H-representation. If the polytope is given 
as an H-representation, then it has to be con- 
verted to a V-representation first to compute its 
volume using Irs. To compute the volume of 
a lower-dimensional polytope, we first move to 



a lower-dimensional parameterization that spans 
the affine subspace where the polytope resides. 

To compute ELFP, we begin by formulating 
the linear programming problem MQ — P sub- 
ject to Q > 0, as described in the main text 
(M being 2 5 x 2 8 , P having 2 5 components). 
M defines the V-representation for ELFP, and 
Vol 8 for ELFP is computed directly from it. Ap- 
plying an algorithm to find an equivalent H- 
representation we obtain a system of 160 inequali- 
ties and 16 equations. We can then substitute the 
expressions in the above matrices into this sys- 
tem and reduce any redundant inequalities and 
equations. The resulting system has 144 nonre- 
dundant inequalities and no equations with the 
Pn,Pi2,P2i,P22,£i,e2) £ i) £: 2 variables. Then we 
algebraically simplify the list of 144 inequalities, 
first into 



-r < -pu + p 2 i + P12 + P22 < i + r, 

~r < PU - P21 + Pl2 + P22 < i + r, 
~r < Pu + P21 - P\2 + P22 < i + r, 

-r < pu + pu + P12 - P22 < i + r, 



-A < pu + Pl\ + Pl2 + P22 < 2 + A, 



I - Pu - Pl\ + P\2 + P22I < 1 + A, 
I - Pu + P21 - P12 + P22I < 1 + A, 
I - Pu + P21 + P12 - P22I < 1 + A, 

where 

T = min{ 1 — e\ — e\ + e\ + e\, 
1 — e\ + e\ — e\ + e\, 
1 — e\ + e\ -\- e\ — e^, 
1 + e\-e\- sl+el, 
l + e\-e\ + e\-el, 

1 + e\+e\-e\-el, 
e\+e\+e\+el, 

2 — £j — e 1 — e 2 — £ 2 }) 
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A = min{ -e\ + e\ 



■A 



el 



el 



'2- 



£1 £-1 So 



e\ 



e\ 



el 



e\ 



■4 

el 



£ l e 2 £21 

el}, 



e\ 



el 



and then, by noticing regularities, into the com- 
pact inequality (4.1). 



Remark. Changing e*. — > 1 /2 — e* leads to (denot- 
ing the new e-vector by e') 

maxSie' = maxSoe, maxSoe' = maxSie. 

Analogously for py — > 1/2 — py , 

maxSip' = maxSop, maxSop' = maxSip. 

It follows that we cannot without loss of gener- 
ality confine all components of s or p to [0, 1/4]. 
But ELFP does not change if the transformation 
x — > 1 /2 — x is applied to an even number of the 
components of (p, e). 



B2. Computations for chaos(p), quant(p), 
and class (p) constraints 

The All cons t r polytopes for the three con- 
straints are obtained by concatenating the ELFP 
equations and inequalities with the constraint in- 
equalities. Then, the volumes are computed by 
using the Irs program as described above. 

For Fit constr polytopes, we observe first that 
they are convex. This follows from 



Fit, 



{e : Vi = 1, . 



(P(i),e) E ELFP } 



= ELFP P(1) n • • • n ELFP PM , 



where pu), i = l,...,n, denote the vertices of 
the 4D convex polytope defined by constr and 
ELFP P(i) denotes the (convex) cross-section of 
the ELFP set formed with p = pny It follows 
that Fit cons tr is convex as the intersection of con- 
vex sets. Following the logic of this observation, 
we have implemented a general program for elimi- 
nating variables from a system of linear equations 
and inequalities so that the resulting system is 



satisfied for exactly those values for which there 
exist such values of the eliminated variables for 
which the original system is satisfied. This pro- 
gram together with steps to ensure that the re- 
sulting representation is minimal was used to find 
all the Fit sets shown in the main text. 

Finding the forcing sets is more difficult as they 
are generally not convex. We characterize them 
using the equation 

Force cna0 s Force CO nstr 

= {e : (Bp : (p, e) G ELFP A ^constr (p))} . 

This equation provides an algorithm: for each in- 
equality in constr, form the conjunction of the 
ELFP inequalities with the negation of the in- 
equality. Then project this conjunction to the 
e 4-space. The union of these projections over 
all inequalities in constr is the set Force cria os — 
Force cons t r - We have implemented a general pro- 
gram that takes as input a representation of a 
polytope, a list of additional constraints, and a 
list of variables to eliminate. It then outputs a 
representation of the difference of the polytope 
and the set represented by the additional con- 
straints projected to the remaining (not elimi- 
nated) variables. This representation consists of 
a list of linear systems whose disjunction char- 
acterizes the resulting set. In all our computa- 
tions it turned out that all the linear systems in 
the disjunction were the same, and so the sets 
Force cna os — Force cons t r are in fact convex in these 
cases. 

The computations of Equi sets require no elab- 
oration. 

Remark. There is the practical problem that the 
negation of a < -inequality is a > -inequality 
while standard algorithms only accept closed con- 
vex polytopes. To cope with this problem, we ap- 
proximated a > b by a > 6+ (very small number). 
We also used a rational approximation to %/2 in 
the quant constraints. In both cases, we have re- 
peated the computations with decreasing values 
of "very small number" until it was obvious where 
the results converged. 



B3. Computations for FitfWp) constraint 

That maxSoP and maxSip are con- 
tained in and completely fill the triangle 
{(0,0), (1/2, 1), (1,1/2)} can be verified by split- 
ting ( |5.1[ ) into 64 component cases according 
as which of the values of SqP and Sip are the 
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maxima, finding the vertices of each component 
system, and drawing the union of these compo- 
nents in maxSoP and maxSip coordinates. The 
triangle is described by 

2 max Sqp — max S±p > 0, 
2 max Sip — maxSop > 0, 
max Sqp + max S±p < 3 / 2 - 

Adding these inequalities to the represen- 
tation of (5.1) as linear inequalities ac- 
cording to the definitions of maxSoe and 
maxSie, we obtain a 6D polytope P' 6 - 1 in 
(e, max Sop, max Sip) -coordinates. In the V- 
representation of P^ , all vertices have values of 
maxSoP and maxSip in the set 

{(o,o),(y4,y 2 ),(y 2 ,y 4 ),(y 2 ,i), (1,1/2)}. 

It follows that every edge of the polytope 
projects to one of these 5 points or to a 
line connecting two of them. Consequently, 
as (maxSoe, maxSie) changes within any trian- 
gle T formed by these lines, the cross-section 



P, 



(4) 



c c , of p( 6 ) retains its structure (face 
lattice) while its coordinates change as affine 
functions of (maxSo£, maxSie) € T. It follows 



that the volume of Pj 4 ^ c c , is a polyno- 
mial of (maxSo£,maxSie) £ T of at most de- 
gree four. The coefficients of these polynomials 
were obtained by fitting unconstrained degree 4 
polynomials to the exact volumes Vol 4 (Fitfi x ( p )) 
for (maxS p, maxSip) G {0, .01, .02, . . . , l} 2 . It 
turns out that the coefficients change only if ei- 
ther of the differences max S$p— 1 / 2 and max S\p— 
x ji changes its sign. In all cases the fit is perfect 
for the number of points far exceeding the number 
of coefficients, confirming that the computations 
are correct. 
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